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PROCESSING SYSTEM AND METHOD FOR EFFICIENTLY 
ENABLING DETECTION OF DATA HAZARDS FOR 
LONG LATENCY INSTRUCTIONS 

\ 

BACKGROUND OF THE INVENTION 
FIELD OF THE INVENTION 

t 

The present invention generally relates to data processing techniques and, in 
particular, to a processing system and method for efficiently indicating when data 
produced by execution of one instruction is available for use in executing other 
instructions so that data hazards associated with the one instruction can be detected. 
The present invention is especially advantageous when the latency for producing the 
data of the one instruction is relatively long. 

r 

RELATED ART 

N To increase the performance of many processors, pipeline processing has been 
developed. In pipeline processing, a processor is equipped with at least one pipeline 
that can simultaneously process multiple instructions. Therefore, execution of one 
instruction in the pipeline may be commenced before the results of execution of a 
preceding instruction in the pipeline are available, thereby creating certain data 

j 

hazards. 

For example, a first instruction, when executed, may produce data and write 
the data to a particular register, and a second instruction, when executed, may utilize 
the data produced by the first instruction. If the second instruction executes before the 
data produced by the first instruction is available, then a data error occurs. 
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Consequently, a data hazard exists between the two instructions until the data 
produced by the first instruction is available. This type of data hazard is commonly 
referred to as a "read-after- write" (RAW) data hazard. 

In another situation, the second instruction, when executed, may write data to 
the same register as the first instruction, which commenced execution before the, 
second instruction. If the second instruction produces and writes its data to the N 
register before the first instruction, then a data error is possible, since the register will 
contain the data of the first instruction instead of the second instruction after both 
instructions are fully executed and processed. As a result, a data hazard exists 
between the two instructions until the data produced by the first instruction is written 
to the register. This type of data hazard is commonly referred to as a "write-after- 
write" (WAW) data hazard. 

To help prevent errors from the aforementioned data hazards and other types 
of data hazards, most superscalar processors include hazard detection circuitry that 
detects data hazards associated with the instructions being processed by the pipelines 
of the processor. The hazard detection circuitry usually detects the aforementioned 

r 

data hazards by detecting when a later instruction utilizes (e.g., writes to or reads 
from) the same register as an earlier instruction that has yet to produce available data. 

However, the data produced by a first instruction may not be available for use 
by other instructions or for writing to a particular register of a processor until well 
after the first instruction has been retired by the pipeline processing the first 
instruction (e.g., until well after the first instruction has exited the pipeline). For 
example, a load instruction, when executed, may generate a request to retrieve data 
from memory external to the processor. Depending on where the data is located in the 
memory hierarchy, it may take anywhere between a few clock cycles to several 
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thousand clock cycles for the data to be returned to the processor in response to the 
aforementioned request. Therefore, the processor may not receive the data produced 
by the first instruction until after the first instruction exits the processor's pipelines or 
is otherwise retired. In other words, the data produced by the first instruction does not 

y 

become available for use in executing other instructions until after the first instruction 
exits the processor's pipeline or is otherwise retired. An instruction, such as the 
aforementioned load instruction, that produces available data after the instruction has 
been retired by a pipeline shall be referred to herein as a "long latency instruction." 

To facilitate the process of detecting data hazards for long latency instructions, 
the processor is usually equipped with a register file, commonly referred to as a 
"scoreboard," that indicates when the processor has yet to receive data produced in 

r 

response to a previous execution of a producer (i.e., an instruction that produces data). 
In other words, the scoreboard indicates when there is a pending write to a general 

v. 

register. As used herein, the term "pending write" shall be used to refer to a situation 
in which data produced by an earlier executed instruction is destined for a general 
register but has yet to be written to the general register. As known in the art, a 
"general register" is one of a set of registers that are either written to via the execution 
of write instructions and/or are read from via the execution of read instructions. 
' The scoreboard includes a plurality of registers in which each register contains 

a bit value. Each scoreboard register and the bit value contained therein correspond to 
one of the general registers of the processor that receives data produced by the 
execution of write instructions. When a write instruction is retired before the data 
produced by the write instruction has been written to the general register that is to 
receive the data, the bit value in the scoreboard register that corresponds to the general 
register is asserted. Furthermore, when the data produced by the write instruction is 
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finally written to the general register, the aforementioned bit value is deasserted. 
Therefore, to determine whether there is a pending write to one of the general 
registers, the bit value in the scoreboard register corresponding to the one general 
register can be analyzed. 

V 

Each asserted bit value in the scoreboard indicates that there is a pending write 
to the general register corresponding to the asserted bit value. Therefore, any , 
instruction being processed by the processor should be prevented from writing to or 
reading from the foregoing general register to prevent errors from RAW or WAW 
data hazards, until at least the pending write expires (i.e., until the data destined for 
the general register is actually written to the general register). Furthermore, each 
deasserted bit value in the scoreboard indicates there is presently no pending write to 
the general register corresponding to the deasserted bit value. Therefore, an 
instruction being processed by the processor may read from or write to the foregoing 

/ * 

general register without creating any errors from RAW or WAW data hazards. 

When a RAW or WAW data hazard is detected by analyzing the scoreboard or 
otherwise, errors from the RAW or WAW data hazard are often prevented by stalling 
one or more instructions being processed by the processor. U.S. Patent Application 
entitled "Superscalar Processing System and Method for Efficiently Performing In 
Order Processing of Instructions," assigned serial no. 09/390,199, and filed on 
September 7, 1999, which is incorporated herein by reference, describes in more detail 
a process of stalling instructions to prevent data hazards. When an instruction is 

r 

stalled, the processor temporarily refrains from further processing the instruction until 
the data hazard causing the stall expires. 

Unfortunately, the wiring and other circuitry typically used to write to and read 
from the scoreboard is relatively complex and often utilizes a relatively large amount 
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of surface area within a processor. To minimize the cost of the processor, it is 
desirable for the circuitry required to detect data hazards to be minimized. In 

> 

particular, it is desirable to minimize and simplify the circuitry required to write to 
and read from the scoreboard. 
5 Thus, a heretofore unaddressed need exists in the industry for providing a 

efficient system and method of utilizing a scoreboard to detect data hazards. 

SUMMARY OF THE INVENTION 

The present invention overcomes the inadequacies and deficiencies of the prior 

r 

1 

10 art as discussed hereinbefore. Generally, the present invention provides a system and 
method for executing instructions of a computer program and for efficiently indicating 
the existences of pending writes to general registers that are used in executing the 
instructions. 

,-• * 

In architecture, the processing system of the present invention utilizes a 
15 plurality of general registers, a plurality of connections, a pipeline, a scoreboard, and 

J 

hazard detection circuitry. The plurality of connections corresponds respectively with 
the general registers. The scoreboard maintains a plurality of bits such that each bit ' 
indicates whether there is a pending write to a corresponding register. The scoreboard 
transmits to the hazard detection circuitry one of the bits that is indicative of whether a 
20 pending write to the one general register exists based on a value of the one bit and 
. based on which of the connections is used to transmit the one bit. The hazard 
detection circuitry then detects whether a data hazard exists based on the one bit. 

In accordance with another feature of the present invention, a decoder is used 

i 

i 

to decode a register identifier associated with data being written to one of the general 
25 registers. The decoder transmits the register identifier to both the scoreboard and the 

5 
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hazard detection circuitry. The hazard detection circuitry may use the register 
identifier to detect a data hazard, and the scoreboard may use the register identifier to 
modify one of the bits in the scoreboard. 

The present invention can also be viewed as providing a method for processing 
instructions of a computer program. The method can be broadly conceptualized by 
the following steps: providing a plurality of general registers; maintaining a plurality 
of bits, each of the bits respectively corresponding with one of the general registers; 
providing a plurality of connections, each of the connections respectively 
corresponding with one of the general registers; indicating, via the bits, which of the 
general registers are associated with pending writes; transmitting one of the bits 
corresponding with a particular one of the general registers across a particular one of 
the connections, the one connection corresponding with the particular one general 
register; and detecting a data hazard based on the one bit transmitted across the 
particular one connection. 

Other features and advantages of the present invention will become apparent to 
one skilled in the art upon examination of the following detailed description, when 
read in conjunction with the accompanying drawings. It is intended that all such 
features and advantages be included herein within the scope of the present invention 
and protected by the claims. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The invention can be better understood with reference to the following 
drawings. The elements of the drawings are not necessarily to scale relative to each 
other, emphasis instead being placed upon clearly illustrating the principles of the 
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invention. Furthermore, like reference numerals designate corresponding parts 
throughout the several views. 

FIG. 1 is a block diagram illustrating a conventional processing system. 

i 

FIG. 2 is a block diagram illustrating a scoreboard of the processing system 
depicted in FIG. 1 . 

FIG. 3 is a block diagram illustrating a computer system employing a 
processing system in accordance with the present invention. 

FIG. 4 is a block diagram illustrating the processing system depicted by FIG. 
3. ' ' ■ 

FIG. 5 is a block diagram illustrating a scoreboard of the processing system 
depicted in FIG. 4. 

FIG. 6 is a block diagram illustrating circuitry that may be used in 
implementing hazard detection circuitry depicted in FIG. 4. 

r 

\ 

\ 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 

The present invention generally relates to a processing system that efficiently 
detects data hazards for long latency instructions with a minimal amount of circuitry 
and complexity. To better illustrate the principles of the present invention, first refer 
to FIG. 1, which depicts a conventional superscalar processing system 15. The 
processing system 15 includes an instruction dispersal unit 18 that receives 
instructions of a computer program and assigns each instruction to one of a plurality 
of pipelines 21. Each pipeline 21 is configured to process and execute, if appropriate, 
each instruction assigned to the pipeline 21. 

Each pipeline 21 is usually configured to only process particular types of 
instructions (e.g., integer operation, floating point operation, memory operation, etc.). 
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Therefore, the instruction dispersal unit 18 is configured to assign each instruction 
only to a pipeline 21 compatible with the instruction. Furthermore, although control 
circuitry 24, hazard detection circuitry 25, and register file 27, which is often referred 
to as a "scoreboard," are shown for simplicity as being coupled to one pipeline 2 1 in 
FIG. 1, it should be noted that each pipeline 21 is similarly coupled to the control 

s 

circuitry 24, the hazard detection circuitry 25, and the scoreboard 27. However, if 
desired, the connections 31, which couple the one pipeline 21 to a write port of the 
scoreboard 27, may be removed for any pipeline 21 that does not process a long 
latency instruction. 

r 

It is well known that data is often written to and read from general registers 29 
associated with the processing system 1 5 in executing the instructions received by the 
processing system 15. Furthermore, when a general register 29 is utilized {i.e., written 
to or read from) in executing an instruction, the instruction usually includes a register 
identifier that identifies the general register 29. The register identifier is usually an n- 
bit encoded binary value that is unique to the general register 29 identified by the 
register identifier. 

The hazard detection circuitry 25 is configured to receive and compare the 
register identifiers included in the instructions being processed by the pipelines 21 to 

i 

i 

determine if any data hazards, such as RAW and/or WAW data hazards, exist between 
the instructions processed by the pipelines 21 . If the hazard detection circuitry 25 
detects a data hazard, the hazard detection circuitry 25 transmits a notification signal 
to the control circuitry 24 indicating which instruction in the pipelines 21 is associated 

with the data hazard. In response, the control circuitry 24 may transmit a control 

\ 

signal to one or more pipelines 21 to cause the one or more pipelines 21 to stall (i.e., 
temporarily prevent further processing of instructions) in order to prevent a data error 

8 
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associated with the data hazard. Once the data hazard expires, the control circuitry 24 
enables the processing of the stalled instruction or instructions to resume. 

i ■ 

Execution of some instructions causes the retrieval of data from external 

i 

registers or locations in memory outside of the processing system 15. When such data 
5 is received by the processing system 15, the processing system 15 usually writes the 
data to a general register 29 identified by the instruction that retrieved the data. 
Usually, the data is then utilized to execute a later instruction. 

The time required to retrieve the data varies depending on where the data is 
located, and the instruction that requests the retrieval of the data is often retired before 
10 the data is received by the processing system 15. However, once an instruction is 

retired and has exited the pipelines 21, the hazard detection circuitry 25 usually can no 
longer detect a RAW or a WAW data hazard associated with the instruction by only 
comparing register identifiers of the instructions in the pipelines 21, even though such 

f r 

.S 

a data hazard may exist until the retrieved data is at least received by the processing 
15 system 15. 

Therefore, additional steps are usually taken to detect data hazards associated 

i 

with instructions that have been retired. For example, in the embodiment shown by 
FIG. 1, if a pending write is caused by a producer that is about to be retired, then the 
register identifier included in the producer and associated with the pending write {i.e., 
20 ^identifying the general register 29 that is to be written to) is transmitted to scoreboard 
27 via a set of connections 31. It should be noted that, for simplicity, FIG. 1 shows 

i 

only one set of connections 31 coupling one of the pipelines 21 to the scoreboard 27. 
However, there are typically multiple sets of connections 3 1 that respectively couple 
multiple pipelines 21 to different write ports of the scoreboard 27. Each pipeline 21 
25 that processes long latency instructions should be coupled to the scoreboard 27 in the 
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manner that the set of connections 3 1 couples one of the pipelines 21 to the 

scoreboard 27 in FIG. 1, although more pipelines 21 may be similarly coupled to the 

i 

scoreboard 27, if desired v 

The scoreboard 27 includes a plurality of registers 32, as shown by FIG. 2. 
5 Each of the scoreboard registers 32 respectively corresponds to a different one of the 



general registers 29 (FIG. 1) and contains a bit value. The scoreboard 27 includes a 



write port 34a coupled to connections 3 1 . The write port 34a receives the 



aforementioned register identifier included in the producer associated with the 



pending write and asserts the bit value in the scoreboard register 32 corresponding to 



10 the general register 29 identified by the received register identifier. The register 



identifier is often encoded, and the write port 34a, therefore, decodes the register 



identifier before asserting the bit in the appropriate scoreboard register 32. 



Referring again to FIG. 1, the data to be written to a general register 29 
associated with a pending write is usually received by an interface 42 along with a 



15 register identifier that identifies the general register 29. The interface 42 transmits the 



foregoing data to the identified general register 29 and transmits the register identifier 



to the scoreboard 27 via connections 45a. The scoreboard 27 includes a write port 



34b (FIG. 2) coupled to connections 45a that receives the register identifier from 



connections 45a and deasserts the bit value in the scoreboard register 32 



20 corresponding to the general register 29 identified by the register identifier. 



Since the interface 42 may receive multiple register identifiers in a single 



cycle, the system 15, as shown by FIGS. 1 and 2, may include multiple sets of other 



connections, such as connections 45b, and multiple other write ports, such as write 



port 34c, to respectively transmit and receive each of the register identifiers received 



25 by interface 42 to scoreboard 27. Furthermore, the scoreboard 27 usually includes 



10 
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multiple write ports 34a that are respectively coupled to different pipelines 21 for 
receiving register identifiers from the different pipelines 21 . Therefore, the number of 
write ports 34a, 34b, and 34c can become quite large and the circuitry for interfacing 
the write ports 34a, 34b, and 34c with the registers 32 can, therefore, be complicated 
5 and expensive. 

By asserting and deasserting the bit values in the registers 32, as described 
above, the hazard detection circuitry 25 can detect data hazards associated with retired 
instructions based on the bits contained in the registers 32 of the scoreboard 27. In 
this regard, if a bit contained in a scoreboard register 32 that corresponds to a 
10 particular one of the registers 29 is asserted, then there exists a pending write to the 
one general register 29. Consequently, a RAW or WAW data hazard may exist, if 

,1 • 

another instruction being processed by the pipelines 21 utilizes data from the one 

/ 

i 

general register 29. However, if the bit contained in the foregoing register 32 is 
deasserted, then there is presently no pending write to the one general register 32 by a 
15 retired instruction. Therefore, a RAW or WAW data hazard associated with a retired 
instruction that writes to the one general register 29 should not exist. 

Each pipeline 21 is usually coupled to one read port 48a, as shown by FIGS. 1 
and 2 for one of the pipelines 21. To detect a data hazard between a pending 

i 

instruction in a pipeline 21 and a previously retired instruction, the pipeline 21 usually 
20 transmits via connections 49 an encoded n-bit register identifier identifying a general 
register 29 that is to be used in executing the pending instruction. The read port 48a 
receives and decodes the register identifier. The read port 48a also analyzes the bit 
contained in the scoreboard register 32 that corresponds with the general register 29 
identified by the register identifier. If the bit in the foregoing scoreboard register 32 
25 indicates that there is a pending write to the general register 29 identified by the 
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register identifier, the read port 48a transmits, via connection 52, a signal indicating 
that a data hazard associated with the foregoing instruction may exist. For example, if 
the bit in the scoreboard register 32 analyzed by the read port 48a is asserted, then the 
read port 48a transmits an asserted one bit signal. Otherwise, the read port 48a 

5 transmits a deasserted one bit signal. Based on the signal transmitted across 

connection 52, the hazard detection circuitry 25 determines whether or not a data 
hazard associated with the pending instruction exists. 

Note that there is a finite amount of delay associated with writing to and 
reading from the scoreboard 27. Therefore, to quickly provide the hazard detection 

10 circuitry 25 with updates, the register identifiers transmitted to the scoreboard 27 via 
the interface 42 also bypass the scoreboard 27 via bypass connections 54a and/or 54b. 
In certain situations, the hazard detection circuitry 25 may utilize the register 
identifiers from bypass connections 54a and/or 54b to prevent or remove a stall on an 
instruction in one of the pipelines 21 . 

15 Furthermore, each instruction may utilize more than one general register 29 

when executed and, therefore, may contain more than one register identifier. A 
separate set of connections 49 and 52 and a separate read port 48a are usually 
provided for each possible register identifier. For example, if the architecture of the 
system 15 is such that each instruction in each pipeline 21 may include up to three 

r 

20 register identifiers, then there is usually at least three read ports 48a, 48b, and 48c 
coupled to each pipeline 21 via three sets of connections 49. Therefore, as can be 
appreciated by one skilled in the art, the number of read ports 48a, 48b, and 48c and 
connections 49 and 52 in the system 15 can be quite large for many superscalar 
processors, thereby increasing the amount of wiring and the complexity of the system 

25 15. To reduce the complexity and cost of the system 15 and to conserve valuable 

12 
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space in the system 15, it is desirable to minimize the circuitry utilized in writing to 
and reading from the scoreboard 27, including the number of write ports 34a, 34b, 
and/or 34c and read ports 48a, 48b, and/or 48c. 

In general, the present invention provides a processing system that efficiently 
and with a minimal amount of circuitry and complexity tracks pending writes caused 
by retired instructions. FIG. 3 depicts a processing system 50 in accordance with the 
principles of the preferred embodiment of the present invention. As shown by FIG. 3, 
the processing system 50 may be employed within a computer system 55 for executing 
instructions from a computer program 57 that is stored in memory 59. 

The processing system 50 communicates to and drives the other elements within 
the system 55 via a local interface 62, which can include one or more buses. 
Furthermore, an input device 64, for example, a keyboard or a mouse, can be used to 
input data from a user of the system 55, and screen display 66 or a printer 68 can be used 
to output data to the user. A disk storage mechanism 71 can be connected to the local 
interface 62 to transfer data to and from a nonvolatile disk (e.g., magnetic, optical, etc.). 
The system 55 can be connected to a network interface 73 that allows the system 5,5 to 
exchange data with a network 75. 

Similar to conventional system 15, processing system 50 includes at least one 
pipeline 21, control circuitry 24, hazard detection circuitry 81, and a scoreboard 82, as 
shown by FIG. 4. Although only one pipeline 21 is coupled to the control circuitry 24 
and the hazard detection circuitry 81 in FIG. 4, the other pipelines 21 are similarly 
coupled to the control circuitry 24 and the hazard detection circuitry 8 1 in the preferred 
embodiment. Furthermore, FIG. 4 shows one pipeline 21 coupled to the scoreboard 82 
via connections 83 and a decoder 84. Each pipeline 21 that processes long latency 

13 
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instructions is similarly coupled to the scoreboard 82 in the preferred embodiment, 
although other pipelines 21 can be similarly coupled to the scoreboard 82, if desired. 

Similar to conventional hazard detection circuitry 25, the hazard detection 
circuitry 81 of the preferred embodiment detects data hazards associated with the 
instructions being processed by the pipelines 21, based on the register identifiers of the 
foregoing instructions. To this end, the register identifier(s) of each instruction in the 
pipelines 21 is transmitted to the hazard detection circuitry 81, which compares the 
register identifiers to detect data hazards. However, each n-bit encoded register 
identifier transmitted to the hazard detection circuitry 81 is first decoded by a decoder 84 
into an m-bit register identifier, where m corresponds to the number of general registers 
29 used by the system 50 to write and read data produced by the execution of 
instructions. ' 

In this regard, each bit of an m-bit decoded register identifier corresponds to a 
particular general register 29. When a decoder 84 receives an n-bit register identifier 
that identifies one of the general registers 29, the decoder 84 is configured to assert the 
bit in the m-bit register identifier that corresponds to the one general register 29 and to 
deassert the remaining bits in the m-bit register identifier. Therefore, by detecting which 
bit in the m-bit decoded register identifier is asserted, it can be determined which 
general register 29 is identified by the m-bit decoded register identifier. Commonly 
assigned U.S. Patent Application entitled "System and Method for Efficiently Detecting 
Data Hazards Between Instructions of a Computer Program/' (atty. docket no. 
1 097 1 1 85), and commonly assigned U.S. Patent Application entitled "System and 
Method for Utilizing Instruction Attributes to Detect Data Hazards," (atty. docket no. 
10971333), which are both incorporated herein by reference, describe in more detail 
circuitry that may be used to implement the decoders 84 and circuitry that may be used 

14 
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by hazard detection circuitry 81 to detect when the register identifiers of two instructions 
match and, therefore, when a data hazard may exist between the two instructions. 

Similar to conventional scoreboard 27, the scoreboard 82 includes a plurality of 

registers 32, as shown by FIG. 5, in which each scoreboard register 32 and, therefore, the 

j 

bit value contained therein correspond to a particular general register 29. When 
asserted, a bit in a scoreboard register 32 indicates that there is a pending write by a 
retired instruction to the corresponding general register 29. When deasserted, a bit in a 
scoreboard register 32 indicates that no such pending write exists. 

Therefore, the scoreboard 82, similar to conventional scoreboard 27, indicates 
which general registers 29 are associated with pending writes caused by retired 
instructions. However, as previously indicated, scoreboard 27 via write ports 34a, 34b, 
and 34c typically decodes the register identifiers transmitted across connections 31, 45a, 

t * 

and 45b. In the preferred embodiment, the scoreboard 82 is coupled to at least one of 
the decoders 84 via connections 83, as shown by FIG. 5. Therefore, the foregoing 
decoder 84 provides an m-bit decoded register identifier to both the hazard detection 
circuitry 81 and the scoreboard 82. Since the register identifier transmitted by 
connections 83 has m bits, each of the connections 83 can be coupled directly to the 
registers 32, thereby eliminating the need of a write port 34a (FIG. 2) in the scoreboard 
82. 

In this regard, each bit transmitted across one of the connections 83 is provided 
to the scoreboard register 32 that corresponds to the same general register 29 as the 
foregoing bit. In other words, when the m-bit register identifier transmitted across 
connections 83 identifies a particular one of the general registers 29, only the scoreboard 
register 32 corresponding to the one general register 29 should receive an asserted bit, 
and the other scoreboard registers 32 should receive a deasserted bit. As a result, there 
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is no need for a write port 34a (FIG. 2) in the scoreboard 82 to detect which scoreboard 
register 32 corresponds to the general register 29 identified by the register identifier 
received by the scoreboard 82. Instead, the same decoder 84 that provides an m-bit 
register identifier to hazard detection circuitry 81 may also be used to provide the 
appropriate bit values to the appropriate registers 32. Accordingly, the amount and 
complexity of the circuitry required to detect data hazards and to appropriately assert the 
bits in the scoreboard 82 are minimized in the preferred embodiment. 

In addition, the register identifiers received from connections 45a and 45b are 
preferably decoded by a decoder 85 instead of write ports 34b and 34c (FIG. 2) in the 
scoreboard 82. In this regard, the decoder 85 is configured to receive the n-bit encoded 
register identifiers presently transmitted from interface 42 and to output an m-bit 
decoded register identifier identifying each of the general registers 29 identified by the 
n-bit register identifiers that are received in the same cycle. 

- 1 

In the preferred embodiment, the value m corresponds to the number of general 
registers 29 in the processing system 50, and each bit of the m-bit decoded register 
identifier transmitted by decoder 85 corresponds to a different general register 29. The 
decoder 85 is configured to assert each bit in the m-bit decoded register identifier that 

\ 

corresponds to a general register 29 identified by one of the n-bit register identifiers 
received in the same cycle and to deassert the remaining bits of the m-bit decoded 
register identifier. Therefore, a general register 29 is identified by the m-bit decoded 
register identifier transmitted by decoder 85 when the corresponding bit in the decoded 

s 

register identifier is asserted. 

When the scoreboard 82 receives a new decoded register identifier from the 
decoder 85, the scoreboard 82 is designed to deassert the bits in the scoreboard registers 

16 
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32 corresponding to the general registers 29 identified by the decoded register identifier, 
since the pending writes to the foregoing general registers 29 should no longer exist. 

In this regard, each scoreboard register 32 receives a bit of the decoded register 
identifier from decoder 85. The scoreboard register 32 preferably corresponds to the 
same general register 29 as the bit received by the scoreboard register 32. In other 

\ 

words, when the decoded register identifier from decoder 85 identifies one of the general 
registers 29, the scoreboard register 32 corresponding to the identified general register 
29 should receive an asserted bit from the decoder 85. Moreover, when the value of a 
bit received from decoder 85 is asserted, the scoreboard register 32 receiving the 

i 

asserted bit value is designed to clear (e.g., to deassert) the bit value contained in the 
register 32. As a result, an asserted bit in a scoreboard register 32 corresponding to a 
particular one of the general registers 29 indicates that there is presently a pending write 
to the one general register 29, and a deasserted bit in the scoreboard register 32 indicates 
that there is presently no such pending write. 

When desired, the scoreboard 82 transmits, via connections 88, to hazard 
detection circuitry 81 a signal corresponding to the data contained in the scoreboard 
registers 32 so that the hazard detection circuitry 81 can properly detect data hazards, 

/ 

such as RAW and WAW data hazards. In the preferred embodiment, each of the 
connections 88 is coupled to a different one of the scoreboard registers 32 and, therefore, 
corresponds to the same general register 29 as the one scoreboard register 32 coupled to 
the connection 88. Therefore, the signal transmitted to hazard detection circuitry 81 is 
an m-bit data word, and each bit of the m-bit data word corresponds to a bit value from a 
different scoreboard register 32. As a result, each asserted bit in the m-bit data word 
indicates that there is a pending write to a particular one of the general registers 29. The 
m-bit data word is essentially an m-bit register identifier that may simultaneously 

17 
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identify more than one of the general registers 29. Since each of the scoreboard registers 
32 is coupled directly to the hazard detection circuitry 83 via one of the connections 88 
in the preferred embodiment, the read ports 48a, 48, and 48c (FIG. 2) in conventional 
scoreboard 27 can be eliminated. 
5 However, since there is a finite amount of delay in writing to and reading from 

the scoreboard 27, the data transmitted by the scoreboard 82 is not always updated. As a 
result, it is possible for the data word transmitted across connection 88 to indicate that a 
pending write exists, when the pending write has, in fact, recently expired. To provide 

K 

the hazard detection circuitry 81 with most up-to-date information available, a bypass 
10 connection 94 that bypasses the scoreboard 82 provides the decoded m-bit register 

identifier from decoder 85 to the hazard detection circuitry 81, similar to bypass 

connections 54a and 54b of conventional system 15. 

The hazard detection circuitry 81 detects data hazards by comparing the m-bit 

register identifiers received from the pipelines 21 via the decoders 84 to the m-bit 
15 register identifier received from connections 88 and/or 94, FIG. 6 shows an exemplary 

■ 

implementation of circuitry 101 that may be used to detect a data hazard by comparing 
an m-bit register identifier received from the pipelines 21 to the m-bit register identifier 
received from connections 88 or 94. 

Each AND gate 104 receives a bit of the m-bit register identifier from the 

20 pipeline 21 and a bit of the m-bit register identifier from the connections 88 or 94. 
Furthermore, each bit received by the same AND gate 1 04 corresponds to the same 
general register 29. Therefore, one of the AND gates 104 should produce an asserted 
output only when the m-bit register identifier from the pipeline 21 identifies one of the 
general registers 29 identified by the m-bit register identifier from connections 88 or 94. 

25 As a result, if one of the AND gates 104 produces an asserted output, then the 

* 
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instruction associated with the m-bit register identifier from the pipeline 21 utilizes a 



general register 29 subject to a pending write, and a data hazard associated with the. 



foregoing instruction may, therefore, exist. 



Consequently, when one of the AND gates 104 produces an asserted output, the 
5 hazard detection circuitry 8 1 detects a data hazard associated with the instruction 



associated with the m-bit register identifier received from the pipeline 21 . However, if 



none of the AND gates 104 produces an asserted output, then there should be no 



pending write associated with the general register 29 identified by the m-bit register 



identifier received from the pipeline 21 . Therefore, there should be no pending write to 



10 the general register 29 utilized by the instruction associated with the m-bit register 



identifier from the pipeline 2 1 , and the hazard detection circuitry 8 1 fails to detect a data 



hazard associated with the foregoing instruction. 



According to the foregoing techniques, the hazard detection circuitry 81 may 



detect a data hazard and transmit a signal to control circuitry 24 indicating that a data 



15 hazard associated with one of the instructions in the pipelines 21 exists. In response, the 



control circuitry 24, in order to prevent a data error from the detected data hazard, may 



transmit a control signal to the pipeline 21 processing the foregoing instruction 



indicating that the foregoing instruction should be stalled. In response, the pipeline 21 



preferably stalls the instruction. 



20 , By later comparing the m-bit register identifier associated with the foregoing 



instruction with the m-bit register identifier received from connections 88 or 94, the 



hazard detection circuitry 81 may detect that the aforementioned data hazard no longer 



exists. In this regard, the data destined for the general register 29 utilized by the 



instruction may have been received by the processing system 50, thereby causing the 



25 aforementioned data hazard to expire. In response to the failure of the hazard detection 
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circuitry 81 to detect a data hazard associated with the aforementioned instruction, the 

-v ■ 

I 

control circuitry 24 transmits a control signal to the pipeline 21 processing the 
instruction indicating that the instruction should not be stalled. In response, the pipeline 
21 removes the stall on the instruction, and processing of the instruction by the pipeline 
21 resumes. - 

It should be noted that the decoder 85 is not a necessary feature of the present 
invention. In this regard, it is possible for the scoreboard 82 to include circuitry for 
decoding, if necessary, the register identifiers received from connections 45 a and/or 45b, 
and the register identifiers transmitted across connections 45a and/or 45b may be 
transmitted directly to hazard detection circuitry 81, similar to conventional system 15. 
In such an embodiment, the hazard detection circuitry 8 1 may include decoders for 
appropriately decoding the register identifiers transmitted across connections 45a and/or 
45b, if necessary. 

Furthermore, for simplicity, the systems 15 and 50 have been shown herein as 
containing only four general registers 29. Therefore, the value of m preferably equals 
four in the embodiments shown by the drawings. However, the system 50 can include 
any number of registers 29 in other embodiments, and the value of m, therefore, can be 
other integer values in other embodiments. Furthermore, it is possible for the register 
identifiers to be decoded or encoded into data words with lengths greater than or less 
than m, although such embodiments are usually less efficient than the preferred 
embodiment. In this regard, providing the hazard detection circuitry 81 with m-bit 
register identifiers not only may reduce the circuitry of the system 50, but may also 
simplify and/or enable better detection of data hazards by data hazard circuitry 81. 

In addition, as can be seen by comparing FIG. 5 to FIG. 2, the configuration of 
the preferred embodiment eliminates the need of including read ports 48a, 48b, and 48c 
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in the scoreboard 82. Furthermore, one of the decoders 84 may efficiently perform the 
functionality of decoding a register identifier that is to be transmitted to both the 

■ 

scoreboard 82 and the hazard detection circuitry 81. As a result, the amount of circuitry 
needed to implement the system 50 is minimized in the preferred embodiment, thereby 
minimizing the cost and complexity of the system 50. 

OPERATION 

The preferred use and operation of the processing system 50 and associated 
methodology are described hereafter. 

Assume that in processing the instructions of computer program 57 (FIG. 3), 
the processing system 50 (FIG. 4) receives a first instruction that, when executed, 
causes data (hereinafter referred to as the "retrieved data") to be retrieved from 
memory 59 and stored in a general register 29 of the processing system 50. Also, 
assume that the processing system 50 later receives a second instruction from the 
program 57 that, when executed, utilizes the retrieved data. Since the aforementioned 
general register 29 is utilized to execute both instructions, the first instruction and the 
second instruction both include a register identifier identifying the aforementioned 
general register 29. 

While the first instruction and the second instruction are simultaneously 
processed by the pipelines 21, the hazard detection circuitry 81 can detect a RAW data 
hazard between the two instructions by comparing the decoded register identifiers of 
the two instructions. At some point, the control circuitry 24 may cause the second 
instruction to be stalled to prevent an error associated with the RAW data hazard. If 
the first instruction is retired before the retrieved data is received by processing system 
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50, then the RAW hazard may exist between the first instruction and second 
instruction, even though the first instruction is retired. 

To enable the hazard detection circuitry 81 to detect the RAW hazard after 
first instruction is retired, an m-bit register identifier identifying the general register 
29 that is to receive the retrieved data is transmitted to scoreboard 82 via connections 
83, which are coupled to the pipeline 21 processing the first instruction. In response, 
the scoreboard 82 asserts the bit in the scoreboard register 32 (FIG. 5) that 
corresponds with the general register 29 identified by the foregoing register identifier 
(i.e., the general register 29 that is to receive the retrieved data). 

i 

Based on the bit value contained in the foregoing scoreboard register 32 and 
the register identifier of the second instruction, the hazard detection circuitry 8 1 may 
detect the aforementioned RAW data hazard, even though the first instruction has 
been retired. In response, the control circuitry 24 may then cause the second 
instruction to be stalled. 

When the retrieved data is received by the interface 42, the register identifier 

< — - 

identifying the general register 29 to receive the retrieved data is transmitted to 
decoder 85 via connections 45a or 45b. In response to the register identifier, the 
decoder 85 produces an m-bit decoded register identifier that identifies at least the 
foregoing general register 29. In this regard, the bit in the decoded register identifier 
corresponding to the foregoing general register 29 is asserted. This m-bit register 
identifier is then transmitted to scoreboard 82, which deasserts the bit in the 
scoreboard register 32 (FIG. 5) corresponding to the foregoing general register 29 in 
response to the asserted bit in the decoded register identifier. The m-bit register 
identifier transmitted to scoreboard 82 is also transmitted to hazard detection circuitry 

81 via bypass connection 94. 

/ i 
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Based on either the deasserted bit value in the aforementioned scoreboard 
register 32 or the foregoing m-bit register identifier transmitted across bypass 
connection 94, the hazard detection circuitry 81 detects that the RAW hazard no 
longer exists. In response, the control circuitry 24 removes the stall on the second 
instruction, and processing of the second instruction resumes without causing a data 
error associated with the aforementioned RAW data hazard. 

Although the present invention has been described herein as utilizing the 
scoreboard 82 to indicate when data produced by long latency instructions, is 
unavailable, it should be noted that the scoreboard 82 may be similarly used to 
indicate when data produced by any type of instruction is unavailable. In particular, 
the scoreboard 82 may be used to indicate whether data produced by pending 
instructions (i.e., instructions that have yet to be retired and are still being processed 
by the pipelines 21) is unavailable. Furthermore, although the present invention has 
been described herein in some examples as utilizing the scoreboard 82 to indicate 
whether data produced by a load instruction is unavailable, the present invention 
should not be so limited, and the scoreboard 82 may also be used to indicate whether 
data produced by any other type of instruction is unavailable. 

It should be emphasized that the above-described embodiments of the present 
invention, particularly, any "preferred" embodiments, are merely possible examples of 

i 

implementations, merely set forth for a clear understanding of the principles of the 
invention. Many variations and modifications may be made to the above-described 
embodiment(s) of the invention without departing substantially from the spirit and . 
principles of the invention. All such modifications and variations are intended to be 

w 

included herein within the scope of this disclosure and the present invention and 
protected by the following claims. 
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